NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Accelerating NCE Convergence with Adaptive Normalizing Constant Computation

Chikina, Maria; Koes, David; Sevekari, Anish; Aggarwal, Rishal (July 2024, Open Review)

Noise Contrastive Estimation (NCE) is a widely used method for training generative models, typically used as an alternative to Maximum Likelihood Estimation (MLE) when exact computations of probability are hard. NCE trains generative models by discriminating between data and appropriately chosen noise distributions. Although NCE is statistically consistent, it suffers from slow convergence and high variance when there is small overlap between the noise and data distributions. Both these problems are related to the flatness of the NCE loss landscape. We propose an innovative approach to circumvent slow convergence rates by quick inference of the optimal normalizing constant at every gradient step. This allows the rest of the parameters to have more freedom during NCE optimization. We analyze the use of both binary search and the Bennett Acceptance Ratio (BAR) for quick computation of the normalizing constant and show improved performance for both methods on convex and non-convex settings.
more » « less
Full Text Available
Improving ΔΔG Predictions with a Multitask Convolutional Siamese Network

https://doi.org/10.1021/acs.jcim.1c01497

McNutt, Andrew T.; Koes, David Ryan (April 2022, Journal of Chemical Information and Modeling)

Full Text Available
Evaluation of Thermochemical Machine Learning for Potential Energy Curves and Geometry Optimization

https://doi.org/10.1021/acs.jpca.0c10147

Folmsbee, Dakota L.; Koes, David R.; Hutchison, Geoffrey R. (March 2021, The Journal of Physical Chemistry A)
null (Ed.)
Full Text Available
The 3Dmol.js Learning Environment: A Classroom Response System for 3D Chemical Structures

https://doi.org/10.1021/acs.jchemed.0c00579

Seshadri, Keshavan; Liu, Peng; Koes, David Ryan (October 2020, Journal of Chemical Education)
null (Ed.)
Full Text Available
Systematic Comparison of Experimental Crystallographic Geometries and Gas-Phase Computed Conformers for Torsion Preferences

https://doi.org/10.1021/acs.jcim.3c01278

Folmsbee, Dakota L.; Koes, David R.; Hutchison, Geoffrey R. (November 2023, Journal of Chemical Information and Modeling)
Conformer Generation for Structure-Based Drug Design: How Many and How Good?

https://doi.org/10.1021/acs.jcim.3c01245

McNutt, Andrew T.; Bisiriyu, Fatimah; Song, Sophia; Vyas, Ananya; Hutchison, Geoffrey R.; Koes, David Ryan (October 2023, Journal of Chemical Information and Modeling)
CACHE Challenge #1: Targeting the WDR Domain of LRRK2, A Parkinson’s Disease Associated Protein

https://doi.org/10.1021/acs.jcim.4c01267

Li, Fengling; Ackloo, Suzanne; Arrowsmith, Cheryl H; Ban, Fuqiang; Barden, Christopher J; Beck, Hartmut; Beránek, Jan; Berenger, Francois; Bolotokova, Albina; Bret, Guillaume; et al (November 2024, Journal of Chemical Information and Modeling)

The CACHE challenges are a series of prospective benchmarking exercises to evaluate progress in the field of computational hit-finding. Here we report the results of the inaugural CACHE challenge in which 23 computational teams each selected up to 100 commercially available compounds that they predicted would bind to the WDR domain of the Parkinson’s disease target LRRK2, a domain with no known ligand and only an apo structure in the PDB. The lack of known binding data and presumably low druggability of the target is a challenge to computational hit finding methods. Of the 1955 molecules predicted by participants in Round 1 of the challenge, 73 were found to bind to LRRK2 in an SPR assay with a KD lower than 150 μM. These 73 molecules were advanced to the Round 2 hit expansion phase, where computational teams each selected up to 50 analogs. Binding was observed in two orthogonal assays for seven chemically diverse series, with affinities ranging from 18 to 140 μM. The seven successful computational workflows varied in their screening strategies and techniques. Three used molecular dynamics to produce a conformational ensemble of the targeted site, three included a fragment docking step, three implemented a generative design strategy and five used one or more deep learning steps. CACHE #1 reflects a highly exploratory phase in computational drug design where participants adopted strikingly diverging screening strategies. Machine learning-accelerated methods achieved similar results to brute force (e.g., exhaustive) docking. First-in-class, experimentally confirmed compounds were rare and weakly potent, indicating that recent advances are not sufficient to effectively address challenging targets.
more » « less
Full Text Available

Search for: All records